A lower score means a better match. As expected, a survey of all heads in the model shows that head 1.5 is an outlier with a uniquely low score, confirming it’s specialized for this task.
I will replace this image
I will replace this image